Interactive Text Mining Suite: Data Visualization for Literary Studies

نویسندگان

  • Olga Scrivner
  • Jefferson Davis
چکیده

In recent years, there has been growing interest in visualization methods for literary text analysis. While text mining and visualization tools have evolved into mainstream research methods in many fields (e.g. social sciences, machine learning), their application to literary studies still remains infrequent. In addition to technological challenges, the use of these tools requires a methodological shift from traditional close reading to distant reading approaches. This transition also aligns digital humanities with corpus linguistics, which still “remains obscure” and not fully embraced by digital humanists [16]. To address some of these challenges, we introduce Interactive Text Mining Suite, a user-friendly toolkit developed both for digital humanists and corpus linguists. We further demonstrate that the integration of visual analytics and corpus linguistics methods helps unveil language patterns, otherwise hidden from a human eye. Making use of both linguistically annotated data and natural language processing techniques, we are able to discern patterns of part-of-speech uses in Medieval Occitan manuscript Romance de Flamenca and its English translation. Furthermore, visual analysis not only detects stylistic differences at a word level, but also at sentential and document levels. While preserving traditional close reading techniques, this toolkit makes it possible to apply an interactive control over documents, thus allowing for a “synthesis of computational and humanistic modes of inquiry” [18].

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

VerseVis: Visualization of Spoken Features in Poetry

The exploration and analysis of literary corpora is a difficult task. Previous approaches to this problem focused on mining data directly from text. However, these solutions do not aid researchers who are interested in learning spoken features of the text, which play an important role in poetic works. VerseVis is a text visualization tool that gives users the ability to identify interesting tex...

متن کامل

Design and Test of the Real-time Text mining dashboard for Twitter

One of today's major research trends in the field of information systems is the discovery of implicit knowledge hidden in dataset that is currently being produced at high speed, large volumes and with a wide variety of formats. Data with such features is called big data. Extracting, processing, and visualizing the huge amount of data, today has become one of the concerns of data science scholar...

متن کامل

Palimpsest: Improving assisted curation of loco-specific literature

1. Introduction This paper reports on interdisciplinary work carried out for the Palimpsest project, focusing on mining literary works set in Edinburgh, a UNESCO City of Literature. 2 The project's aim is to use text mining to scour accessible literary works and find those mentioning Edinburgh or places within it. We ground " loco-specific " passages of text by identifying their latitudes and l...

متن کامل

Information Extraction and Interactive Visualization of Road Accident Related News

This paper describes a strategy of extracting information from raw data and visualizing them in web browser. Raw data are collected from newspaper. These raw data are in English language. By implementing text mining process specific information extracted and this process explained clearly. Derived information is specifically on road accident related news but raw data contains all kind of news. ...

متن کامل

Information Visualization with Text Data Mining for Knowledge Discovery Tools in Bioinformatics

An abundant amount of information is produced in the digital domain, and an effective information extraction (IE) system is required to surf through this sea of information. In this paper, we show that an interactive visualization system works effectively to complement an IE system. In particular, three-dimensional (3D) visualization can turn a data-centric system into a user-centric one by fac...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017